Entropies and combinatorics of random branching processes and context-free languages

نویسندگان

  • Michael I. Miller
  • Joseph A. O'Sullivan
چکیده

AWract-The entropies and combinatorics of trees that branch according to fixed but finite numbers of rules are studied. Context-free grammars are used to categorize the ways in which nodes branch to yield daughter nodes, thus providing an organized setting to examine the entropies for random branching processes whose realizations are trees and whose probabilities are determined by probabilities associated to the substitution rules of the grammar. Normalized entropy rates H are derived for the critical branching rate (p = 1) and supercritical branching rate (p > 1) processes. An equipartition theorem is proven for the supercritical processes proving that L-generation trees normalized by their number of nodes have log probability converging to the entropy rate H with L, almost everywhere in the nonextinction set. A strong departure from classical theorems for Markov sources occurs for super-critical branching processes p > 1 as the typical sets have super-geometric growth rates. Defining the a-typical set of trees to be the L-generation trees with log of their negative log probability within 6 of log p, then the typical set has probability equaling the nonextinction probability and log growth rate of pL. The combinatorics of the set of all trees that can be generated from the context-free substitution rules is also studied. It is proven that for all context-free grammars that are strongly connected and have at least one substitution rule with two daughters or more, the combinatoric growth rate of the set of trees is also supergeometric and equals the largest growth rate of any random branching process with the same substitution rules. Instances of regular, pseudo-linear and context-free grammars are studied for demonstrating the theory, and as a particular example it is shown that the arithmetic expression language has log-number of unique L-generation programs growing at a rate 1.75488L.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Complexity Measures and Cellular

A bstract . Various observables meas urin g the complexi ty of an ensemble of pattern s are discussed , in par ticul ar statistical quantit ies rela ted to the convergence of block entropies, an d comp utation t heoret ical quanti ties related to a gram matical description of t he ensem ble. T hese measures of com plexity are applied to one-dime nsio nal cellular automata, by char acterizing th...

متن کامل

On the Length of the Wadge Hierarchy of Omega-Context Free Languages

We prove in this paper that the length of the Wadge hierarchy of ω-context free languages is greater than the Cantor ordinal εω, which is the ω th fixed point of the ordinal exponentiation of base ω. We show also that there exist some Σ0ω-complete ω-context free languages, improving previous results on ω-context free languages and the Borel hierarchy.

متن کامل

Conjunctive Grammars

This paper introduces a class of formal grammars made up by augmenting the formalism of context-free grammars with an explicit set-theoretic intersection operation. It is shown that conjunctive grammars can generate some important non-contextfree language constructs, including those not in the intersection closure of context-free languages, and that they can provide very succinct descriptions o...

متن کامل

Statistical Properties of Probabilistic Context-Free Grammars

We prove a number of useful results about probabilistic context-free grammars (PCFGs) and their Gibbs representations. We present a method, called the relative weighted frequency method, to assign production probabilities that impose proper PCFG distributions on finite parses. We demonstrate that these distributions have finite entropies. In addition, under the distributions, sizes of parses ha...

متن کامل

Central Limit Theorem in Multitype Branching Random Walk

A discrete time multitype (p-type) branching random walk on the real line R is considered. The positions of the j-type individuals in the n-th generation form a point process. The asymptotic behavior of these point processes, when the generation size tends to infinity, is studied. The central limit theorem is proved.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IEEE Trans. Information Theory

دوره 38  شماره 

صفحات  -

تاریخ انتشار 1992